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Abstract 

Background: Metabolic resistance to insecticides is the biggest threat to the continued effectiveness of malaria 
vector control. However, its underlying molecular basis, crucial for successful resistance management, remains 
poorly characterized. 

Results: Here, we demonstrate that the single amino acid change L1 19F in an upregulated glutathione S-transferase 
gene, GSTe2, confers high levels of metabolic resistance to DDT in the malaria vector Anopheles funestus. Genome-wide 
transcription analysis revealed that GSTe2 was the most over-expressed detoxification gene in DDT and permethrin- 
resistant mosquitoes from Benin. Transgenic expression of GSTe2 in Drosophila melanogaster demonstrated that 
over-transcription of this gene alone confers DDT resistance and cross-resistance to pyrethroids. Analysis of G5Te2 
polymorphism established that the point mutation is tightly associated with metabolic resistance to DDT and its 
geographical distribution strongly correlates with DDT resistance patterns across Africa. Functional characterization 
of recombinant GSTe2 further supports the role of the L1 19F mutation, with the resistant allele being more efficient at 
metabolizing DDT than the susceptible one. Importantly, we also show that GSTe2 directly metabolizes the pyrethroid 
permethrin. Structural analysis reveals that the mutation confers resistance by enlarging the GSTe2 DDT-binding cavity, 
leading to increased DDT access and metabolism. Furthermore, we show that GSTe2 is under strong directional 
selection in resistant populations, and a restriction of gene flow is observed between African regions, enabling the 
prediction of the future spread of this resistance. 

Conclusions: This first DNA-based metabolic resistance marker in mosquitoes provides an essential tool to track the 
evolution of resistance and to design suitable resistance management strategies. 



Background 

Insecticide-based control interventions, notably long last- 
ing insecticide nets (LLINs) and indoor residual spraying 
(IRS), are key components of malaria control in Africa. 
Unfortunately, increasing resistance to available insecti- 
cide classes across Africa in major malaria vectors, such 
as the mosquito Anopheles funestus, is threatening the 
continued effectiveness of these control tools. Elucida- 
ting the molecular basis of insecticide resistance in 
these vectors is crucial to designing suitable resistance 
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management strategies and preventing potentially dev- 
astating public health consequences [1]. 

Pyrethroids are the only class of insecticide used for 
LLIN impregnation, and are also the predominant in- 
secticide class used in IRS, while dichlorodiphenyltri- 
chloroethane (DDT) is still retained for use in IRS, due 
to the limited number of cost-effective alternatives. The 
two major causes of pyrethroid and DDT resistance are 
target-site insensitivity and metabolic resistance [2]. Target- 
site resistance (knockdown resistance (kdr)) can be easily 
monitored by PCR, while metabolic resistance is not easily 
tracked due to its complex molecular basis, despite its 
greater operational impact on malaria control [1]. The de- 
tailed molecular and structural basis through which candi- 
date metabolic resistance genes confer resistance remains 
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poorly characterized [3-5]. No single metabolic resistance 
marker has been identified in malaria vectors. Conse- 
quently, there is still no DNA-based diagnostic tool to de- 
tect metabolic resistance easily in field populations unlike 
target site resistance. Such tools are needed to detect and 
track resistance at an early stage allowing control programs 
to design rational, evidence-based resistance management 
strategies. 

To address this gap in our knowledge, we dissected 
the molecular basis of metabolic resistance in a DDT/ 
pyrethroid-resistant An. funestus population from Benin 
that is kdr-free [6]. Using genome-wide transcriptional 
and functional analyses combined with structural and 
population genetics studies, we conclusively demonstrated 
that a single amino acid change in the binding pocket 
of the glutathione-s-transferase epsilon 2 (GSTe2) gene, 
coupled with increased transcription, confers a high level 
of DDT resistance and also cross-resistance to pyrethroids 
in the major African malaria vector An. funestus. 

Results and discussion 

DDT resistance profile of Anopheles funestus in Benin and 
Cameroon mosquitoes 

The Pahou population (Benin) had previously been de- 
scribed as highly DDT resistant [6] with no mortality 
24 hr after 1 hr of exposure. The WHO bioassays con- 
ducted in Kpome (Benin) in this study indicated that this 
An. funestus population, which is located approximately 
100 km from Pahou, was also resistant to DDT, with 
only 5% mortality 24 hr after 1 hr of exposure to 4% 
DDT for females and 21% mortality for males. The 
population from Gounougou in Cameroon was also re- 
sistant to DDT, but at a moderate level, with 52% mor- 
tality for females and 65% for males. 

Genome-wide transcription microarray analysis 

A comparative genome-wide transcription analysis using 
the 44,000-probe (60-mer) custom microarray chip pre- 
viously designed by Riveron et al. [5] was successfully 
used to identify the set of genes associated with DDT re- 
sistance in Pahou (Benin). A total of 1,352 probes were 
differentially expressed (> twofold at P<0.01) between 
the DDT-resistant samples from Pahou and the suscep- 
tible strain FANG, with 321 upregulated and 1,031 
downregulated in the Pahou samples compared with the 
susceptible strain FANG (Additional file 1: Figure SI A). 
The lists of the top probes that were differentially upreg- 
ulated or downregulated are presented in Table 1 and 
Additional file 2: Table SI respectively. 

The most upregulated detoxification gene in Benin was 
a glutathione S -transferase gene, GSTe2, with a fold- 
change (FC) of 11.9 (P = 0.0076) (Table 1 and Additional 
file 3: Table S2; Additional file 1: Figure S1A). The 
consistency of this upregulation is further supported by 



the fact that both probes designed for GSTe2 were upregu- 
lated in the Pahou population (Table 1). Furthermore, 
orthologs of GSTe2 were also upregulated in DDT- 
resistant strains in other mosquito species such as An. 
gambiae [3,7] and Aedes aegypti [8], suggesting that this 
gene most likely plays a key role in DDT resistance in 
many mosquito species. 

The two duplicated cytochrome P450 genes, CYP6P9a 
(FC = 6.4) and CYP6P9b (FC = 3.9), which confer pyreth- 
roid resistance in southern African populations [5], were 
also upregulated in the Benin population. However, be- 
cause their encoded proteins are unable to metabolize 
DDT [5], they are most likely associated with the per- 
methrin resistance observed in the Pahou mosquitoes. 
Other genes with a known association with insecticide 
resistance were also upregulated in Pahou mosquitoes 
and are detailed in Additional file 4 Supplementary text 
and listed in Table 1. 

Validation of the microarray upregulation with 
quantitative RT-PCR 

Quantitative reverse-transcriptase PCR (qRT-PCR) con- 
firmed the significant upregulation of GSTe2 in Benin 
(FC = 44.8, P = 0.007) (Additional file 1: Figure SIB) in 
comparison to the FANG susceptible strain. The GSTe2 
gene is significantly over-transcribed in DDT-exposed 
mosquitoes compared to unexposed mosquitoes (FC = 82.0 
vs 44.8, P < 0.05) (Figure 1A), suggesting that induction of 
GSTe2 occurs in addition to constitutive over-expression 
in resistant mosquitoes. The expression pattern of GSTe2 
across Africa significantly correlates with DDT-resistance 
patterns. This gene is 69.5- and 79.1 -fold upregulated in 
highly DDT-resistant mosquitoes from Benin (P < 0.05) 
compared with the fully DDT-susceptible Mozambique 
and Malawi samples, respectively (Figure 1A). It is 35-fold 
upregulated (P<0.05) compared to Ugandan samples 
(moderate DDT resistance [9]). Similarly, the two P450s 
CYP6P9a and CYP6P9b had 2.96 and 7.13 times more ex- 
pression in mosquitoes from Pahou than in the susceptible 
FANG mosquitoes. 

Overall, transcription analyses indicated that GSTe2 is 
the main detoxification gene associated with DDT resist- 
ance in Pahou mosquitoes. In other mosquito species, 
DDT resistance is conferred by additional mechanisms, 
such as the knockdown resistance (kdr) mutation (An. 
gambiae [10] and Aedes aegypti [8]), or by upregulation 
of cytochrome P450s, as observed for An. gambiae for 
CYP6Z1 [11] and CYP6M2 [12]. None of these mecha- 
nisms are present in An. funestus because no kdr muta- 
tion was detected in this population [6], and the only 
upregulated P450s in this population (CYP6P9a and 
CYP6P9b) are unable to metabolize DDT [5]. These re- 
sults indicate that GSTe2 is the main detoxification gene 
associated with DDT resistance in this Benin population 
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Table 1 Upregulated genes in DDT-resistant Benin population of Anopheles funestus after microarray 



Gene families 



Gene name 



Orthologs in 
Anopheles gambiae 



Corrected 
P value 



Fold- 
change 



Log 2 

fold-change 



Description 



Glutathione 
S-transf erases 

Cytochrome P450s 



Carboxylesterases 

Short-chain 
dehydrogenases 



Cuticle proteins 



Odorant binding 
proteins 



Proteases 



Transporters 



Heat shock proteins 



Salivary proteins 



UDP (uridine 
diphosphate) 
glucosyltransferases 

Other functions 



combined_c920 

combined_c920 

CYP6P9a 

CYP6P9b 

gb-C0EAE1G 

combined_c738 

AGAP008125-RA 
AGAP001405-RA 
combined_c1061 
AGAP001405-RA 
AGAP005999-RA 
AGAP006867-RA 

AGAP010906-RA 
combined_c1722 
combined_c1722 
AGAP006867-RA 

AGAP001012-RA 

combined_c1773 

combined_c1773 

AGAP009390-RA 

AGAP004857-RA 

gb-SP8905 

AGAP006674-RA 

combined_c1563 

AGAP004904-RA 

CD578028.1 

combined_c1762 

combined_c3512 

AGAP012626-RA 

combined_c4173 

combined_c4173 

AGAP002076-RA 

EE589639.1 
EE590018.1 
EE589541.1 
combined_c121 1 

combined_c1691 
AGAP003209-RA 



AGAP009194 
AGAP009194 
AGAP002865 
AGAP002865 
AGAP006700 
AGAP005166 

AGAP008125 
AGAP001405 
AGAP011852 
AGAP001405 
AGAP005999 
AGAP006867 

AGAP010906 
AGAP010122 
AGAP010122 
AGAP006867 

AGAP001012 

AGAP006368 
AGAP006368 
AGAP009390 
AGAP004857 
AGAP003640 
AGAP006674 
AGAP003581 
AGAP004904 
AGAP006485 

AGAP006364 
AGAP006186 
AGAP012626 
AGAP001424 
AGAP001424 
AGAP002076 

AGAP008281 
AGAP008216 
AGAP008283 
AGAP007920 

AGAP013141 
AGAP003209 



0.0076 
0.0047 
0.0091 
0.0078 
0.0071 
0.0016 

0.0057 
0.0033 
0.0079 
0.0035 
0.0048 
0.0034 

0.0038 
0.0061 
0.0031 
0.0037 

0.0038 

0.0064 
0.0072 
0.0082 
0.0036 
0.0057 
0.0059 
0.0039 
0.0056 
0.0081 

0.0094 
0.0039 
0.0047 
0.0021 
0.0049 
0.0033 

0.0063 
0.0022 
0.0086 
0.0030 



11.9 
8.8 
6.4 
3.9 
5.1 
26.0 

5.3 
3.8 
3.7 
3.7 
10.8 
5.2 

3.6 
5.0 
4.8 
2.9 



2.7 
2.7 
2.1 
5.2 
4.0 
3.4 
2.3 
4.5 
3.8 

2.8 
2.2 
6.9 
11.6 
9.6 
2.7 

3.0 
2.2 
2.1 
2.1 



0.0041 34.4 
0.0083 5.0 



3.6 Glutathione S-transferase gste2 
3.1 Glutathione S-transferase gste2 

2.7 Cytochrome P450 
2.0 Cytochrome P450 
2.4 Alpha-esterase 

4.7 Short-chain dehydrogenase 

2.4 Short-chain dehydrogenase 

1.9 Short-chain dehydrogenase 

1.9 Short-chain dehydrogenase 

1.9 Short-chain dehydrogenase 

3.4 Pupal cuticle 

2.4 Adult-specific cuticular 
protein acp-20 

1.9 Cuticle protein 

2.3 Pupal cuticle protein 

2.3 Pupal cuticle protein 

1.5 Adult-specific cuticular 
protein acp-20 

3.0 Candidate odorant receptor 

1.5 Odorant binding protein 4 

1.4 Odorant binding protein 4 

1.1 Olfactory receptor 

2.4 Clip-domain serine proteinase 

2.0 Prolylcarboxypeptidase 

1.8 Serine protease 

1.2 Chymotrypsin-like protein 

2.2 Catalase 

1.9 Serine threonine-protein 
kinase riol 

1.5 ABC transporter 

1.1 Calcium-transporting ATPase 
2.8 Serotonin transporter 

3.5 Glycoprotein 93 

3.3 Glycoprotein 93 

1.4 Heat shock protein 
cognate isoform a 

1.6 d7-related 1 protein 
1.1 gsg7 salivary protein 
1.1 d7-related 3 protein 

1.1 Glucosylglucuronosyltransferases 



5.1 Mediator complex 

2.3 Sterol desaturase 
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Table 1 Upregulated genes in DDT-resistant Benin population of Anopheles funestus after microarray (Continued) 



combined. 


_c3636 


AGAP001423 


0.0031 


3.7 


1.9 


Bifunctional purine 
biosynthesis protein 


combined. 


_c3636 


AGAP001423 


0.0040 


3.6 


1.8 


Bifunctional purine 
biosynthesis protein 


combined. 


_c8340 


AGAP003581 


0.0047 


2.7 


1.4 


Sorbitol dehydrogenase 


combined. 


_c1577 


AGAP000523 


0.0047 


2.1 


1.1 


ATP synthase lipid-binding 
mitochondrial precursor 



of An. funestus. Therefore, further analysis of this gene 
would be very good for elucidating metabolic resistance 
mechanisms and for understanding the detailed molecular 
basis through which it confers resistance to insecticides. 

Transgenic expression of GSTe2 in Drosophila flies 

To establish whether the upregulation of GSTe2 alone 
can confer DDT and pyrethroid resistance, transgenic 
Drosophila melanogaster flies expressing GSTe2 cloned 
from Benin were generated using the GAL4/UAS system 
under the ubiquitous Act5C-GAL4 driver (Act5C-GSTe2). 



After confirming the expression of GSTe2 in transgenic 
F x progeny by qRT-PCR (Additional file 1: Figure SIC), 
bioassays revealed that the transgenic Act5C-GSTe2 flies 
were resistant to 4% DDT (19.1% mortality after 24 hr 
exposure), whereas, all the controls that did not over- 
express GSTe2 were susceptible (83.5% to 97.9% mortality, 
P < 0.0001) (Figure IB). These results indicate that GSTe2 
upregulation alone is sufficient to confer DDT resistance. 
Such a transgenic expression approach was also recently 
successfully used to confirm the involvement of two P450s 
(CYP6P9a and CYP6P9b) in conferring pyrethroid 




GAL4-Actin ^— Exp-Gste2 Gal4-Actin ^^EXP-GSTe2 




[DDT] jiM Permethrin +GSTe2 

-* L119-GSTe2 119F-GSTe2 



Figure 1 GSTe2 expression and functional analysis. (A) Comparative qRT-PCR examining DDT-resistant (Benin) and DDT-susceptible (Mozambique, 
Malawi and Uganda) mosquitoes. (B) DDT bioassay tests on transgenic Act5C-GSTe2 flies (Exp-GSTe2) and control strains (two parental (UAS-GSTe2 and 
GAL4-actin) and Ft progeny that do not express the GSTe2 transgene (Cont-NO)). (C) The same bioassays with permethrin. (D) DDT metabolic activity 
(depletion rate after 1 hr) of GSTe2 alleles (mean ± standard deviation). (E) Michaelis-Menten enzyme kinetics for resistant and susceptible GSTe2 alleles 
(F) Permethrin metabolic activity (depletion rate after 1 hr) for the 1 19 F GSTe2 allele (mean ± standard deviation). DDT, dichlorodiphenyltrichloroeth- 
ane; MAL, Malawi; MOZ, Mozambique; qRT-PCR, quantitative reverse transcriptase polymerase chain reaction; UG, Uganda, DDE, dichlorodiphenyldi- 
chloroethylene. 
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resistance in An. funestus [5] and previously in Drosophila 
[13], indicating the usefulness of this technique. 

Noticeably, the upregulation of GSTe2 also conferred 
cross-resistance to pyrethroids. Indeed, bioassays per- 
formed with 2% permethrin revealed that the transgenic 
Act5C-GSTe2 flies (35.6% mortality after 24 hr expos- 
ure) but not the controls (78.9% to 97.2% mortality, 
P < 0.001) (Figure 1C) were permethrin resistant. A 
moderately reduced mortality rate (P<0.05) was ob- 
served in transgenic flies after 0.15% deltamethrin ex- 
posure only after 24 hr (Additional file 1: Figure SID). 
However, the mortality rate obtained for pyrethroids was 
higher than for DDT, suggesting that the resistance con- 
ferredto pyrethroids by GSTe2 is lower than to DDT. 
Such observations are in accordance with the resistance 
profile of the Pahou population, which is highly resistant 
to DDT but only moderately resistant to permethrin [6]. 
This potential role of GSTe2 in permethrin resistance is 
in agreement with previous reports that suggested that 
orthologs of GSTe2 in other mosquitoes are associated 
with pyrethroid resistance. Indeed, microarray analyses 
of An. gambiae [3] reported upregulation of GSTe2 in a 
permethrin-resistant strain, with the suggestion that the 
protein for this gene could be involved in permethrin re- 
sistance either by acting as a pyrethroid-binding protein 
and sequestering the insecticide [14] or by protecting 
mosquitoes against the oxidative stress and lipid peroxi- 
dation caused by exposure to permethrin [15]. In the 
dengue fever mosquito A e. aegypti, a partial knockdown 
of the ortholog of GSTe2 led to increasing mortality to 
permethrin, indicating that GSTe2 is also associated with 
permethrin resistance in this species [8]. This cross- 
resistance to pyrethroids is of significant concern for 
malaria control as GSTe2 could protect mosquitoes 
against the major insecticides used in public health. 

Detection of resistance markers in GSTe2 

To detect resistance markers for GSTe2, we analyzed 
the full-length cDNA (666 bp) from resistant Benin 
mosquitoes, and four susceptible mosquito strains from 
Uganda, Malawi, Mozambique and Zambia. There were 
19 polymorphic sites (8 replacement substitutions) from 
24 clones. The major difference in the Benin strain was 
a fixed leucine (CTT) to phenylalanine (TTT) replace- 
ment (L119F). The Benin haplotypes form a unique 
clade compared with the haplotypes from the other 
countries (Figure 2A). 

Analysis of the full-length GSTe2 (881 bp including in- 
trons) of six resistant and six susceptible mosquitoes 
from another Benin location (Kpome, 6°55'N, 2°19'E, 
because no susceptible mosquitoes were available from 
Pahou) confirmed that the GSTe2 polymorphism was 
associated with DDT resistance. No polymorphisms 
were identified for the resistant mosquitoes and there 



was a single haplotype (Hapl-R) bearing the resistant 
119 F allele (Figure 2B,C; Additional file 3: Table S2). 
Five heterozygote polymorphic sites were identified for 
susceptible mosquitoes with a single amino acid change, 
L119F. The presence of a susceptible haplotype Hap2-S 
bearing the LI 19 susceptible allele was also observed 
(Figure 2B,C). 

Correlation between the L119F mutation and DDT and 
pyrethroid resistance 

The TaqMan assay, designed to genotype the L119F 
mutation, unambiguously detected the three genotypes 
(Additional file 5: Figure S2A) in 35 susceptible and 35 
resistant mosquitoes from Gounougou (9°05'N, 13°40'E) 
in Cameroon (west-central Africa) (as very few suscep- 
tible mosquitoes were obtained from Benin). The distri- 
bution of the genotype frequencies between susceptible 
and resistant mosquitoes significantly differed (x 2 = 99.7, 
degrees of freedom = 2 and P < 0.0001) (Figure 3B). A 
significant association was observed between the L119F 
mutation and DDT resistance, with an odds ratio of 5.74 
(P < 0.0001) when comparing the frequency of the resist- 
ant and susceptible alleles in both sample sets. When 
comparing the frequency of the two homozygous geno- 
types (T/T and C/C), the T/T mutant genotype was more 
highly associated with DDT resistance than the C/C wild 
genotype with an odds ratio of 22.3 (P < 0.0001). Indeed, 
72.5% of the mosquitoes with the homozygote mutant al- 
lele (T/T) were resistant to DDT whereas approximately 
half of the heterozygote mosquitoes were resistant to 
DDT (54.5%), and only 10% of the homozygote wild-type 
(C/C) were resistant to DDT, suggesting co-dominance of 
this allele (Figure 3B). The correlation of the L119F muta- 
tion with DDT resistance indicates that this TaqMan assay 
could be used as a diagnostic tool to detect and map the 
distribution of DDT resistance in An. funestus in Africa. 

This is the first report to our knowledge of a point 
mutation conferring metabolic resistance in a mosquito 
species. To date, metabolic resistance to DDT has only 
been associated with a single haplotype of the P450 gene 
CYP6G1 in D. melanogaster [16] or to a potential allelic 
variation in the ortholog of GSTe2 in Ae. aegypti [8] but 
not to a single point mutation as detected in this study 
in An. funestus. Such a point mutation conferring meta- 
bolic resistance has previously only been described for 
the house fly (Musca domestica) for the alpha esterase 
E7 gene [17] and for the sheep blowfly (Lucilia cuprina) 
for the carboxylesterase E3 gene [18]. Detection of the 
L119F mutation in An. funestus, which confers high 
DDT resistance, indicates that similar point mutations 
conferring resistance to other insecticide classes could 
also be found in mosquitoes, notably in malaria vectors, 
allowing the design of reliable molecular diagnostic assays 
that can accurately detect and track metabolic resistance 
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Figure 2 GSTe2 polymorphism and DDT resistance. (A) Maximum likelihood tree of GSTe2 cDNA across Africa. (B) Same as in (A), but with 
DDT-resistant (AL) and DDT-susceptible (DE) mosquitoes in Benin (genomic DNA). (C) GSTe2 haplotype network (TCS) between susceptible and 
resistant mosquitoes (Benin). The polygon size reflects the haplotype frequency. Each node represents a mutation (number). (D) GSTe2 expression 
profile of the three L1 19F genotypes in Cameroon (Gounougou). BN, Benin; DDT, dichlorodiphenyltrichloroethane; MAL, Malawi; UG, Uganda; 
ZB; Zambia. 



in the field. Although a similar trend was observed for the 
pyrethroids permethrin and lambda-cyhalothrin, the cor- 
relation between L119F and resistance to these insecti- 
cides was not significant (P = 0.08 for permethrin, P = 0.1 
for lambda-cyhalothrin) (Additional file 5: Figure S2C,D). 
Hence, the cross-resistance to pyrethroids may be primar- 
ily conferred by the quantitative change in GSTe2 expres- 
sion rather than the qualitative change from the L119F 
mutation. 

In contrast, DDT resistance was more closely associ- 
ated with the L119F mutation than the level of GSTe2 
over-expression (Figure 2D). Indeed, despite a lower 
fold-change in the susceptible LI 19 genotype, no signifi- 
cant difference (P > 0.05) in the GSTe2 expression level 
was observed between the homozygote resistant RR 
(FC = 26.6), the heterozygote RS (FC = 23) and the 
homozygote susceptible SS (FC = 16.5) (Figure 2D). 
These results suggest that the L119F mutation could be 
the predominant cause of DDT resistance. 



Geographical distribution of L119F across Africa 

Genotyping 30 mosquitoes from each of nine African 
countries revealed that the geographical distribution of 
the 119 F resistant allele strongly correlates with the 
distribution of DDT resistance across Africa [19,20] 
(Figure 3C; Additional file 5: Figure S2B). The 119 F 
resistant allele is fixed in highly DDT-resistant Benin 
mosquitoes but completely absent in fully susceptible 
southern African populations [5,21,22]. It also occurs in 
other DDT-resistant populations in West Africa with a 
frequency of 48.2%, 44.2% and 25%, respectively, in 
Cameroon, Ghana and Burkina Faso, in correlation with 
the previously reported prevalence of DDT resistance 
in these countries [19,20]. The resistant 119 F allele 
has also been detected in the eastern African countries 
of Uganda (20.4%) and Kenya (7.8%) but with lower fre- 
quencies, reflecting the moderate level of DDT resis- 
tance that was previously reported [9]. The L119F 
distribution in Africa is similar to that of the A296S 
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Malawi 

Mozambique 
Uganda 

Benin 



Figure 3 L119F association with DDT resistance and GSTe2 variability across Africa. (A) Evidence of the strong correlation between 
L1 19F genotypes and DDT resistance in Gounougou (Cameroon) (25 resistant and 25 susceptible). (B) Differential distribution of L1 19F genotypes 
between DDT-resistant and DDT-susceptible mosquitoes. (C) The geographical distribution of L1 19F across Africa correlates with the DDT 
resistance distribution. (D) GSTe2 haplotype network (TCS) (coding regions only) across Africa, with resistant (grey) and susceptible (maroon) 
clades defined by L1 19F. Each haplotype is represented by a circular or rectangular shape and the size proportional to its frequency in the 
sample while mutational steps are represented by line numbers. (E) Genetic distances between six African populations based on GSTe2 sequences 
{Nst estimates). DDT, dichlorodiphenyltrichloroethane. 



resistance-to-dieldrin (RDL) mutation, which confers 
dieldrin resistance [20] and probably reflects contempor- 
ary patterns of gene flow between An. funestus popula- 
tions across Africa. Indeed, as for L119F, no resistance 
allele was found for RDL in southern Africa in line with 
the full susceptibility to dieldrin in this region. Similarly, 
as for L119F, only a low frequency of RDL was detected 
in Uganda in east Africa. 

Metabolic assays with heterologous GSTe2 enzyme 

An assessment of the DDT dehydrochlorinase activity of 
the recombinant 119 F resistant enzyme (Benin) and 
that of susceptible LI 19 revealed that the 119 F allele is 
3.4 times more efficient at metabolizing DDT (82.9% 
DDT depletion after 1 hr reaction in the presence of the 
cofactor glutathione (GSH); P< 0.001) than the L119 



allele (24.4% depletion) (Figure ID and Additional file 6: 
Figure S3 A). This is confirmed by their respective kinetic 
parameters (Figure IE), with the 119 F allele having a 
higher catalytic efficiency (kcat//<T m ratio) for DDT 
(316.3 [iM^.s- 1 vs 92.0 uJVTV 1 ) (Table 2). 

Additionally, significant permethrin metabolism was 
observed for the Benin 119 F-GSTe2 allele (45% deple- 
tion, P = 0.016) (Figure IF; Additional file 6: Figure S3B 
and S3C) with the high-performance liquid chromatog- 
raphy (HPLC) metabolism profile showing three poten- 
tial metabolite peaks. This suggests that GSTe2 confers 
permethrin resistance by directly metabolizing this in- 
secticide. Similarly, recent research has shown that a 
glutathione S -transferase (GST) from Culex pipiens 
(CpGSTDl) was able to metabolize pyrethroid-like fluor- 
escent substrates directly, including a permethrin-like 
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Table 2 Kinetic parameters of the resistant (119 F) and susceptible (L1 19) GSTe2 alleles 





l/ max (DDE umol.min ^mg n ) 


K m (uM DDT) 


Kcat (s _1 ) 


Kcat/K m (uM _1 .s _1 ) 


119 F-GSTe2 


60.8 ± 7.2 


149.6 ±39.6 


4731 2 ±5297 


316.3 


L119-GSTe2 


3.75 ± 0.26 


34 + 5.9 


3 129 ±222.6 


92.0 



Three independent assays were performed for both alleles and the results show mean ± standard deviation. 



substrate [23]. The nature of the permethrin metabolites 
corresponding to the three peaks observed in this study 
remains unknown. Three potential metabolic pathways 
have been proposed for GST metabolism of pyrethroid- 
like substrates including halogen substitution, Michael 
addition, and thiolysis [23]. Because none of the three 
metabolite peaks in this study matched the permethrin 
ester hydrolysis products, phenoxybenzyl alcohol or phe- 
noxybenzoic acid (produced by the Michael addition and 
the thiolysis routes), it is likely that a GSH conjugate to 
permethrin through the halogen substitution metabolic 
route could be the main mechanism. However, further 
investigations are needed to determine the nature of 
these metabolites. But it is possible that GSTe2 could 
also confer permethrin resistance through other mecha- 
nisms such as sequestration [14] or protection against 
oxidative stress and lipid peroxidation [15]. 

In contrast, no significant metabolism was observed 
for deltamethrin, which is in line with the low delta- 
methrin resistance observed in the field in Pahou (88% 
mortality for deltamethrin vs 66% for permethrin), but 
also the lower deltamethrin resistance observed in trans- 
genic UAS-GSTe2 Drosophila flies. 

Genetic diversity of GSTe2 across Africa 
Signature of positive selection on GSTe2 in Benin 

Analysis of the 882-bp full-length sequence of GSTe2 
(663 bp for the three exons and 219 bp for the two in- 
trons and part of the 3 ' UTR) from field-collected mos- 
quitoes from six countries revealed that GSTe2 is under 
strong directional selection in Benin in contrast to other 
countries (Table 3 and Additional file 7: Table S3, 
Additional file 8: Figure S4A). Indeed, the significant re- 
duction of genetic diversity of GSTe2 in Benin is 



apparent with only one polymorphic site observed in a 
single mosquito out of 12 from Pahou, in contrast to the 
seven to nineteen polymorphic sites in the other popula- 
tions. Only two haplotypes were present in Pahou mos- 
quitoes (the minor haplotype was only present as a 
heterozygote in one out of twelve individuals), in con- 
trast to the seven to ten haplotypes for the other coun- 
tries (Additional file 8: Figure S4A; Table 3). Genetic 
diversity parameters such as haplotype diversity (hd) and 
nucleotide diversity for Benin are significantly lower 
than for the other five countries (Additional file 8: 
Figure S4A). This reduced diversity of GSTe2 for Benin 
mosquitoes suggests the presence of a selective sweep 
across and around this gene. A future assessment will 
establish the extent of this selective sweep in this gen- 
omic region on chromosome 2 L. 

However, other tests of selection such as the McDo- 
nald and Kreitman (MK), Hudson-Kreitman-Aguade 
(HKA), dNIdS and KJK S ratios and Tajimas D tests did 
not show any signature for positive selection in Benin 
(Additional file 7: Table S3). This could be due to the 
fact that the sweep around GSTe2 in the Benin popula- 
tion is nearing fixation as shown by the 100% frequency 
of the resistant 119 F allele in this population. In a situ- 
ation of near fixation of the selective sweep, directional 
selection is better shown by reduced levels of genetic 
variation [24]. Evidence of selection through reduced 
genetic diversity was not observed in Cameroon and 
Ghana, where DDT resistance has also been detected, al- 
though not at the high level as in Benin. However, the 
presence of the predominant Benin-resistant haplotype 
in these west African countries, although at a lower fre- 
quency, could result from the combined effect of local 
DDT selection and migration. 



Table 3 Genetic parameters between coding and non-coding regions for all three genes in the three countries 



Complete sequenced fragment (882 bp) Coding region (663 bp) 



Non-coding region (219 bp) 



Gene 


2n 


S 


h 


hd 


TT 


e 


D 


D* 


S 


h 


hd 




e 


D 


D* 


S 


h 


TTnc 


D 


D* 


Benin 


24 


1 


2 


0.08 


0.09 


0.3 


-1.15 ns 


-1.6 ns 


1 


2 


0.08 


0. 13 


0.4 


-1.15 ns 


-1.6 ns 


0 


1 


0 






Cameroon 


14 


7 


8 


0.89 


2.9 


2.5 


0.6 ns 


0.21 ns 


5 


6 


0.77 


2.4 


2.4 


0.12 ns 


-0.22 ns 


2 


3 


4.2 


1.3 ns 


0.93 ns 


Ghana 


16 


19 


10 


0.86 


5.4 


6.8 


-0.8 ns 


-0.38 ns 


4 


5 


0.67 


2.0 


1.8 


0.35 ns 


0.25 ns 


15 


9 


16 


-1.1 ns 


-0.55 ns 


Mozambique 


10 


12 


8 


0.95 


3.6 


5.2 


-1.4 ns 


-1.4 ns 


2 


3 


0.37 


0.6 


1.1 


-1.4 ns 


-1.6 ns 


10 


8 


13 


-1.3 ns 


-1.25 ns 


Malawi 


18 


14 


7 


0.78 


4.4 


4.6 


-0.14 ns 


1.14 ns 


4 


5 


0.71 


1.6 


1.7 


-0.28 ns 


0.21 ns 


10 


6 


13 


-0.05 ns 


1.41* 


Uganda 


10 


17 


9 


0.97 


7.5 


6.8 


0.5 ns 


0.15 ns 


3 


4 


0.71 


1.7 


1.6 


-0.76 ns 


-1.12 ns 


14 


7 


26 


0.72 ns 


0.41 ns 


Total 


92 


37 


39 


0.86 


5.1 


8.7 


-1.3 ns 


-0.33 ns 


13 


16 


0.73 


2.3 


3.8 


-1.07 ns 


-0.8 ns 


24 


29 


0.013 


-1.3 ns 


-0.06 ns 



2n, number of sequences; D, Tajima's statistics; D*, Fu and Li's statistics; h, number of haplotypes; hd, haplotype diversity; ns, not significant; n, nucleotide diversity 
multiplied by 10 3 ; S, number of polymorphic sites; 0, Watterson's estimator (per site) multiplied by 10 3 . 



Riveron et al. Genome Biology 2014, 15:R27 
http://genomebiology.com/201 4/1 5/2/R27 



Page 9 of 20 



Directional selection has previously been observed in 
this species in southern African pyrethroid-resistant 
populations at the two tandemly duplicated P450 genes 
CYP6P9a and CYP6P9b [5]. Another P450, CYP6G1, 
which confers DDT resistance in D. melanogaster, is also 
under strong directional selection, with a single resistant 
haplotype containing an Accord transposable element in 
the 5' UTR region distributed globally [16]. The same 
CYP6G1 gene is also under directional selection in D. 
simulans, with a 100-kb region having extensive reduced 
heterozygosity due to a selective sweep around CYP6G1 
[25]. These observations further suggest that insecticide 
resistance is an excellent example of natural selection 
at work. As expected, the non-coding regions exhibit 
greater genetic diversity than the coding regions in all 
countries, with more polymorphic sites and haplotypes 
apart from the Benin population, where the unique poly- 
morphic site is in the coding region (Table 3). 

Haplotype distribution of GSTe2 across Africa 

A total of 39 haplotypes were observed for the full gene 
(Additional file 8: Figure S4B), 16 for coding regions 
only (Additional file 8: Figure S4C), and there were 
seven protein variants (Additional file 8: Figure S4D) 
across Africa. Resistant haplotypes (with the 119 F resist- 
ant allele) resolving around a predominant haplotype 
BN23 (nearly fixed in Benin and only present in DDT- 
resistant locations) all had reduced diversity, which is 
indicative of a recent selection with fewer haplotypes 
(11 out of 39), and more homogeneity (less mutational 
step differences <4). The susceptible haplotypes (with 
the LI 19 allele) resolved around a predominant haplo- 
type MAL3 and were more diverse, with more haplo- 
types (29 out of 39) and greater heterogeneity (up to 13 
mutational step differences). Overall, the haplotype dis- 
tribution of GSTe2 across Africa (Additional file 8: 
Figure S4B,C,D) reveals that the L119F mutation is the 
main polymorphism shaping GSTe2 genetic diversity 
with the mutational step at this allele consistently defin- 
ing a resistant and a susceptible group of haplotypes 
(Figure 3D and Additional file 9: Figure S5). 

Population substructure of GSTe2 across Africa 

Analysis of the genetic diversity of GSTe2 indicated that 
An. funestus populations are clearly structured according 
to their pattern of DDT resistance and by geographical 
distance. The construction of a maximum likelihood 
phylogenetic tree of GSTe2 sequences revealed that 
Cameroon and Ghana mosquitoes cluster with Benin 
mosquitoes (Figure 4), correlating with DDT-resistance 
profiles. Susceptible populations from Mozambique and 
Malawi have the lowest genetic differentiation (K st = 
0.016) (Additional file 10: Table S4; Figure 3E). The 
moderately resistant east Africa Ugandan population has 



an intermediate differentiation to all populations. This 
pattern of gene flow correlates with the genetic structure 
patterns of An. funestus populations across Africa based 
on microsatellite markers [26], suggesting that there are 
barriers to gene flow between African An. funestus pop- 
ulations, which will affect the spread of insecticide- 
resistance genes. 

Structural basis of DDT resistance conferred by GSTe2 

To identify the structural changes responsible for the 
higher activity in the resistant GSTe2 Benin allele (119 F), 
its X-ray three-dimensional structure was determined and 
analyzed in comparison with that of a L119-GSTe2 suscep- 
tible allele cloned from Uganda (UG-GSTe2) (Figure 5A,B). 
The topology of GSTe2 indicates that the 221 amino acids 
of the An. funestus GSTe2 (afGSTe2) are divided into two 
distinct domains similar to that of An. gambiae [27] (92% 
similarity between the two species, with a difference of 20 
amino acids). The N-terminal domain and a C-terminal 
domain are connected by a short hinge loop called the 
linker (amino acids 80-89) (Figure 5C). The N-terminal do- 
main (1-79) has the typical TRX-fold (papappa) domain in 
which the central four-stranded mixed p-sheet (B1-B4) is 
flanked on one side by helices HI and H3 and on the other 
side by H2. The C-terminal domain (90-221) is a five 
a-helix bundle (H4-H8) in which the long a-helix H4 is no- 
ticeably bent [27]. The active site is located in a deep cleft 
formed at the interface of the two domains, specifically by 
the interaction of HI and H3 from the N-terminal domain 
and H4, H6 and H8 from the C-terminal domain. 

Overall, the structure of BN-GSTe2 and UG-GSTe2 are 
very similar (Figure 5C, Additional file 11: Figure S6). 
However, significant local differences can be observed 
between the alleles at the C-terminal ends of H4 (contain- 
ing the L119F mutation; residues from 113 to 128) and 
H8 (residues from 213 to 220) (Figure 6A). These differ- 
ences can be described as a concerted movement of the 
BN-GSTe2 H4 and H8 helices with respect to those of 
UG-GSTe2, which opens the active site cleft. The root 
mean squared deviation (RMSD) between the structures is 
0.64 A for all Ca atoms. There is a significant decrease of 
the RMSD between the alleles from 0.64 to 0.42 A when 
excluding H4 (containing the 119 F mutation) and the 
RMSD is only 0.33 A when excluding both H4 and H8. A 
higher RMSD is observed when the two alleles are com- 
pared only for the H4 helix (RMSD = 0.9 A) (Figure 6A). 
Furthermore, the significant difference between the BN- 
GSTe2 and UG-GSTe2 alleles is further compounded by 
the observation that the UG-GSTe2 allele structure is 
more similar to the structure of the An. gambiae GSTe2 
protein [PDB:2imi] (RMSD = 0.4 A) than to Benin-GSTe2 
(RMSD = 0.64 A) despite a 20-amino-acid difference 
(92% similarity) (Figure 6B), indicating that the L119F 
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Figure 4 Maximum likelihood phylogenetic tree of GSTe2 (coding and non-coding regions) across Africa. The analysis involved 92 
sequences (2n) labeled. All positions with gaps and missing data were eliminated. There were a total of 881 positions in the final dataset. A resistant 
clade that is less polymorphic and dominated by the Benin population is clearly identifiable to the right of the tree whereas a susceptible clade that 
is more polymorphic and more cosmopolitan is at the left of the tree. BN, Benin; CAM, Cameroon; GH, Ghana; MAL, Malawi; MOZ, Mozambique; UG, 
Uganda. 



mutation is the key factor that modifies the GSTe2 struc- 
ture (Figure 6C). 

The active site of GSTe2 can be divided into two sub- 
sites. Firstly, there is the glutathione binding site, called 
the G-site, where one GSH molecule is bound. Secondly, 
there is the H-site, which recognizes the hydrophobic 
substrate. The architecture of the G-site of this super- 
family of proteins is well described and nearly identical 
to that observed in BN-GSTe2 and UG-GSTe2. Al- 
though slight variations were observed between the 
G-sites of both alleles, there is no significant conform- 
ational difference, and these G -binding pockets do not 
have any major rearrangement (Additional file 11: Figure 
S6). The H-site is a large and slightly open hydrophobic 
cavity adjacent to the G-site. It is built of residues from 
different loops at the N-terminal domain and noticeably 
includes the variable C-terminal ends of helices H4 and 
H8 at the C-terminal domain (Figure 5C). Consequently, 



the differences observed between BN-GSTe2 and UG- 
GSTe2 mainly affect the structure of the substrate- 
binding pocket (H-site) (Figure 6A). 

Putative DDT-binding pocket (H-site) and the DDT 
metabolism reaction 

Leull9 normally forms part of the solvent-inaccessible 
hydrophobic core that stabilizes the conformation of 
helices H4, H5 and H8 of Uganda-GSTe2 (Figure 6A). 
However, to accommodate the bulkier Phell9 side chain 
in the Benin-GSTe2 structure, the N-terminal end of H4 
has a dramatic bend, which increases the size of the 
H-site in comparison to the UG allele, where the cavity 
is tighter (Figure 6C). The H-site surface representation 
of both proteins with a bound GSH confirms that the re- 
sistant BN allele has a larger putative DDT cavity than 
the susceptible UG allele (Figure 6C). 
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Figure 5 GSTe2 purification and X-ray three-dimensional structure. (A) Size-exclusion chromatogram of GSTe2 alleles from Benin (BN), 
Uganda (UG) and Malawi (MAL) alleles. The lines show the absorbance recorded at 280 nm. Molecular-weight markers (Bio-Rad, Hercules, 
CA, US) are indicated in kilodaltons. (B) SDS-PAGE gel of the three purified GSTe2 alleles (26.8 kDa). (C) Topology of GSTe2 showing the C- and 
N-terminals, the GSH binding pocket (G-site) and the substrate-binding pocket (H-site). AU, arbitrary units; BN, Benin; GSH, glutathione; MAL, 
Malawi; MW, molecular weight; UG, Uganda; vol, volume. 



Attempts to co-crystallize the substrate DDT or the 
product DDE with BN-GSTe2 and UG-GSTe2 were 
unsuccessful. Therefore, to investigate the substrate- 
binding properties of both alleles and assess their 
capacity to hydrolyze DDT, a substrate molecule was 
docked into the H-site (Figure 6C). This docking of a 
DDT molecule into the H-site of both alleles demon- 
strated that the Benin-GSTe2 has the appropriate size 



and shape to accommodate DDT better into a close 
to reactive' conformation. This includes the proper 
stabilization of the DDT chloride-phenyl rings and the 
optimal positioning of the DDT Ca, pointing towards 
the thiolate from glutathione (Figure 6C) leading to in- 
creased DDT metabolism. In contrast, the smaller H-site 
of Uganda-GSTe2 does not allow DDT recognition in 
this conformation (Figure 6C) leading to reduced DDT 
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Figure 6 GSTe2 three-dimensional structure variations reveal that L119F is important for DDT resistance. (A) Overlay of Benin-GSTe2 
(purple) and Uganda-GSTe2 (green) structures and a close-up of the H4/H8 helices containing the L1 19F mutation. (B) Overlay of Uganda-GSTe2 
(green) and An. gambiae GSTe2 (grey) structures showing their very similar conformation. (C) Comparison of the substrate-binding pockets of 
Benin-GSTe2 (purple) and Uganda-GSTe2 (green). (Left) The molecular surface representation of the H-site shows that DDT docks well into the 
Benin-GSTe2 binding site but clashes with the Uganda-GSTe2 structure. (Right) Close-up of the effect of L1 19F on the H-site (this mutation bends 
H4, leading to an increase in the DDT-binding cavity in Benin-GSTe2 but not for Uganda-GSTe2). Ct, C-terminal; DDT, dichlorodiphenyltrichloroeth- 
ane; GSH, glutathione; Nt, N-terminal. 



access and metabolism. This difference at the H-site ex- 
plains the increased active site accessibility, the higher 
activity and consequently the high DDT resistance of the 
Benin population compared with other populations such 
as those from Uganda or Malawi. 

A comparison of the structure of GSTe2 for An. funes- 
tus to the structures of AgGSTe2 and agGSTdl-6 clearly 
indicates that the key factors in the high metabolic activ- 
ity of BN-GSTe2 are the H4 helix position and the shape 
and size of the DDT-binding cavity (Additional file 12: 
Figure S7). GSTdl-6, which belongs to the delta class of 
GST, possesses less than 1/350 the DDT-metabolizing 
activity of AgGSTe2 [7] and was previously shown to dif- 
fer from AgGSTe2 mainly at the H4 and H8 helices [27]. 
The H-site surface representation of these four proteins 
shows that the AgGSTdl-6 cavity, although larger than 
those of AfGSTe2s and AgGSTe2, does not have the 
proper shape for docking a DDT molecule inside. This 



explains why it has the lowest DDT-metabolizing activity. 
BN-GSTe2, UG-GSTe2 and agGSTe2 have a V-shaped 
cavity, which will fit DDT. However, BN-GSTe2 has the 
largest binding pocket, which is able to accommodate the 
DDT molecule in a close to reactive' conformation leading 
to the higher resistance it confers to DDT. 

The proteins for the BN -GSTe2 and UG-GSTe2 alleles 
are separated by two amino acid changes, L119F and 
1131 V. However, only L119F is located in the H4 helix 
whereas 113 IV is located in the H5 helix, where no con- 
formational change is observed between the BN and UG 
forms. In addition, the AgGSTe2 form, which is struc- 
turally similar to UG-GSTe2, contains 1131, like the BN 
form. Therefore, this indicates that the 1 13 IV mutation 
does not play a role in the DDT metabolic activity of the 
BN allele and confirms that the molecular basis of the 
high DDT resistance conferred by the BN-GSTe2 allele 
compared with the susceptible allele is solely explained 
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by the significant conformational changes that are in- 
duced by the L119F mutation. This finding is in agree- 
ment with what was previously predicted by [27]. This 
previous study suggested that for the An. gambiae GSTe2 
to be more active against DDT, more conformational 
changes in the DDT-binding pocket were needed to fur- 
ther accommodate a DDT substrate and that such adjust- 
ments should not be on a large scale because the pocket 
was already well shaped. These changes are exactly what 
happened in the Benin form, where a single amino acid 
change induced the adjustment needed to accommodate 
more DDT in this resistant An. funestus strain, conferring 
a high level of DDT resistance in contrast with the UG al- 
lele, where this adjustment is absent. Overall, these struc- 
tural analyses demonstrate that the L119F mutation is the 
causative mutation that confers DDT resistance. 

Conclusions 

This study presents a comprehensive and detailed dissec- 
tion of the genetic, molecular and structural basis of the 
metabolic resistance to an insecticide in a major malaria 
vector. Firstly, we conclusively detected the main gene re- 
sponsible and showed that the resistance is conferred by a 
qualitative and a quantitative change in the DDT metabol- 
izing enzyme (GSTe2). Secondly, we detected, for the first 
time, to our knowledge, for a mosquito species, a molecu- 
lar resistance marker for metabolic resistance and de- 
signed a reliable DNA-based diagnostic assay that can 
accurately detect and map the distribution of resistance 
across Africa. This diagnostic tool will be valuable for vec- 
tor control programs as resistance can be detected at an 
early stage, allowing suitable resistance management strat- 
egies to be implemented. Thirdly, we showed that the geo- 
graphical distribution of DDT resistance, its origin and 
future spread patterns can be established or predicted 
based on the patterns of GSTe2 genetic diversity, which 
are mainly due to the single L119F mutation. 

Materials and methods 

Mosquito collection 

Blood-fed adult female An. funestus mosquitoes resting in- 
doors were collected in houses between 6.00 AM and 
12.00 PM in Pahou (6°23'N, 2°13'E), which is located near 
the Atlantic coast in southern Benin, west Africa. Several 
collections were conducted between July 2009 and April 
2011. Another collection was conducted in Kpome (6°55' 
N, 2°19'E), which is located inland approximately 100 km 
from Pahou, in December 2011. The other samples used 
in this study have been previously described, and the re- 
spective references are provided when the samples are 
mentioned. Mosquito collection and rearing were per- 
formed as described previously [6,9]. Briefly, F 2 adults 
were generated from field-collected female mosquitoes 
(using an egg-forced laying method [9]) and were 



randomly mixed in cages for subsequent experiments. All 
females used for individual oviposition were morphologic- 
ally and molecularly identified as An. funestus ss, as previ- 
ously described [21]. 

Bioassays 

Insecticide susceptibility assays with 4% DDT and 0.75% 
permethrin were conducted using 2- to 5-day-old F 2 adults 
from pooled Fi mosquitoes, as described previously [21]. 

Microarray 

A custom An. funestus microarray chip containing 
44,000 probes (60-mer) (A-MEXP-2245), previously de- 
scribed by Riveron et al. [5], was used to identify the set 
of genes associated with DDT resistance in Benin mos- 
quitoes. Labeled complementary RNA (cRNA) was ob- 
tained from three biological replicates of DDT-resistant 
mosquitoes from Pahou that had been unexposed to in- 
secticide and from susceptible unexposed mosquitoes 
(from the fully susceptible laboratory strain FANG). The 
Pahou mosquitoes were all DDT resistant as no mortal- 
ity was recorded after 1 hr exposure to 4% DDT [6]. 

RNA was extracted from three batches of ten An. 
funestus females that were all 2 to 5 days old from 
the two sample sets using the Picopure RNA isolation 
kit (Arcturus, Applied Biosystems, Carlsbad, CA, USA). 
The quantity and quality of extracted RNA were asses- 
sed using the NanoDrop ND1000 spectrophotometer 
(Thermo Fisher, Waltham, MA, US) and Bioanalyzer 
(Agilent, Santa Clara, CA, USA), respectively. The cRNA 
from each sample was amplified using the Agilent Quick 
Amp labelling Kit (two-color) following the manufac- 
turers protocol. The cRNA from the resistant Pahou 
samples was labelled with cy5 dye whereas the suscep- 
tible strain FANG (S) was labelled with the cy3 dye. The 
cRNA quantity and quality were assessed before labelling 
using the NanoDrop and the Bioanalyzer. Labelled cRNAs 
were hybridized to the arrays for 17 hr at 65°C, according 
to the manufacturers protocol. Five hybridizations were 
conducted by swapping the biological replicates. 

Microarray data were analyzed using Genespring GX 
12.0 software. To identify the differentially expressed 
genes, a twofold- change cutoff and a significance level of 
P < 0.01 with Benjamin-Hochberg correction for mul- 
tiple testing were applied. 

Quantitative RT-PCR 

The genes that were most associated with resistance from 
the microarray analysis were assessed by qRT-PCR to valid- 
ate their expression pattern using three biological replicates 
for Pahou resistant mosquitoes (R) (alive after 24 hr expos- 
ure to DDT), Pahou control mosquitoes (C) (not exposed 
to any insecticide) and susceptible FANG mosquitoes (S). 
The primers are listed in Additional file 13: Table S5. Next, 
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1 ug total RNA from each of the three biological replicates 
from Pahou resistant, Pahou control and susceptible FANG 
mosquitoes was used as the template for cDNA synthesis 
using Superscript III (Invitrogen, Carlsbad, CA, US) with 
oligo-dT20 and RNase H, according to the manufacturers 
instructions. The qRT-PCR amplification was conducted as 
previously described [5]. The relative expression and fold- 
change of each target gene in R and C relative to S was cal- 
culated according to the 2" AACT method, incorporating 
PCR efficiency [28] after normalization with the housekeep- 
ing genes RSP7 (ribosomal protein S7, AGAP010592) and 
actin 5C (AGAP000651). 

Transgenic expression of GSTe2 in Drosophila flies 
Construction of transgenic Drosophila lines 

The full-length GSTe2 gene was amplified from the 
cDNA of the resistant Benin mosquitoes using Phusion 
High-Fidelity DNA Polymerase (Fermentas, Burlington, 
Ontario, Canada) and the following conditions: 1 cycle 
at 95°C for 5 min; 35 cycles of 94°C for 20 s, 57°C for 
30 s and 72°C for 60 s; and 1 cycle at 72°C for 5 min. 
The primers used are listed in Additional file 13: Table S5. 
The PCR products were purified using the QIAquick PCR 
Purification Kit (Qiagen, Valencia, CA, USA) and cloned 
into the pJET1.2/blunt cloning vector using the Clone- 
JET™ PCR Cloning Kit (Fermentas, Burlington, Ontario, 
Canada). Five positive clones from both samples were 
purified with a QIAprep® Miniprep kit (Qiagen, Valencia, 
CA, USA) and sequenced on both strands. After sequence 
analysis, one clone of the GSTe2 that was predominant in 
Benin mosquitoes was selected for constructing the trans- 
genic flies. This clone was re-amplified as described above 
with primers containing restriction sites for Bgttl and 
Xbal. The purified PCR products were digested using the 
Bglll and Xbal enzymes (Fermentas, Burlington, Ontario, 
Canada) , cloned into the pUASattB vector (provided by 
Dr J Bischof, University of Zurich), pre-digested with the 
same restriction enzymes and transformed into JM109 
cells (Promega, Madison, Wi, US). Using the PhiC31 sys- 
tem, clones were injected into the germ-line of D. melano- 
gaster carrying the attP40 docking site on chromosome 

2 (y 1 w 67c23 ; P (CaryP) attP40,l;2) [29]. One transgenic 
line, UAS-GSTe2, was generated and balanced. For the ex- 
pression of the transgene GSTe2, ubiquitous expression 
was obtained in the flies using the Act5C-GAL4 strain 
(y 1 w*; P (Act5C-GAL4-w) El/CyO,l;2) (Bloomington 
Stock Center, IN, USA). 

Confirmation of GSTe2 expression in the transgenic flies by 
quantitative RT-PCR 

To confirm the expression of GSTe2 in the experimental 
groups and the absence of expression in the control 
groups, total RNA was extracted from three pools of five 
flies. The cDNA synthesis was conducted as described 



above. PCR was performed using the synthesized cDNA 
as a template and primers specific to GSTe2 (Additional 
file 13: Table S5). In addition, the relative expression 
levels of the GSTe2 transgene were assessed by qRT- 
PCR in the F x progeny under the Act5C driver and in re- 
spective controls with normalization using the RPL11 
housekeeping gene. 

Drosophila contact bioassays 

Females from F 2 expressing GSTe2 were selected as the 
experimental group for the insecticide bioassays. The 
parental lines and the female progeny from the cross be- 
tween the Act5C-GAL4 females and males that did not 
carry the GSTe2 transgene (but with the same attP40 
background) were used as controls. A comparison of the 
mortality rates between the experimental groups and the 
control groups was used to assess whether GSTe2 was 
conferring resistance. In addition, 2- to 5 -day-old females 
post-eclosion were used in a contact assay with 4% DDT 
and the pyrethroid insecticides permethrin (2%) and delta- 
methrin (0.15%), as described previously [5]. Then 20 to 
25 flies were placed in each vial, and the mortality plus 
knockdown was scored after 1 hr, 2 hr, 3 hr, 6 hr, 12 hr 
and 24 hr of exposure to the insecticide. For all assays, at 
least six replicates were performed. Students £-test was 
used to compare the mortality plus knockdown in the ex- 
perimental group with each control group. 

Analyzing GSTe2 polymorphisms in relation to DDT 
resistance 

Analysis of cDNA polymorphisms 

The full-length cDNA of GSTe2 was cloned and se- 
quenced for DDT-resistant mosquitoes from Benin and 
for DDT-susceptible An. funestus across Africa (Uganda, 
Malawi, Mozambique and Zambia) to detect potential 
mutations that could be associated with DDT resistance. 
The cDNA amplification was performed using the same 
cDNA synthesized for qRT-PCR with the Phusion poly- 
merase, and the product was cloned and sequenced as de- 
scribed above. The primers used are listed in Additional 
file 13: Table S5. 

Polymorphism analysis between susceptible and resistant 
field mosquitoes in Benin 

A further assessment of the correlation of the poly- 
morphism of GSTe2 and DDT resistance was conducted 
by individually amplifying and direct-sequencing the 
genomic full-length sequence of GSTe2 (all exons and 
introns) for six susceptible mosquitoes (dead after 1 hr 
exposure) and six resistant mosquitoes (alive after 1 hr 
exposure) from Kpome because no susceptible mosqui- 
toes were obtained in Pahou. The polymorphic positions 
were detected through a manual analysis of sequence 
traces using BioEdit and as sequence differences in 
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multiple alignments using ClustalW [30]. dnaSP 5.10 
[31] was used to define the haplotype phase (through 
the Phase option) and to assess genetic parameters, such 
as nucleotide diversity tt, haplotype diversity and the D 
and D* selection estimates. A maximum likelihood tree 
of the haplotypes for both cDNA and genomic amplifi- 
cations was constructed using MEGA 5.2 [32], and a 
haplotype network was built using the TCS program 
[33] (95% connection limit, gaps treated as a fifth state) 
to assess the potential connection between haplotypes 
and resistance phenotypes. 

Diagnostic assay and assessment of whether LI 19F 
correlates with DDT resistance 

To genotype the GSTe2 L119F mutation in field popula- 
tions of An. funestus, a custom TaqMan assay was designed 
after repeated failures from pyrosequencing due to the 
many thymine nucleotides (Ts) around the C/T mutation 
site. The primer and reporter sequences are provided in 
Additional file 14: Table S6. The TaqMan reactions were 
performed in a 10- [il final volume containing lx SensiMix 
(Bioline, London, UK), 800 nM of each primer and 200 nM 
of each probe using an Agilent MX3005P machine. The fol- 
lowing cycling conditions were used: 10 min at 95°C, 40 cy- 
cles of 15 s at 92°C and 1 min at 60°C. This assay was used 
to assess the correlation between the genotypes of the 
L119F mutation and DDT-resistant phenotypes. For this 
assay, due to the lack of susceptible mosquitoes from Pahou 
and the few susceptible mosquitoes from Kpome (in Benin), 
An. funestus samples were collected in northern Cameroon 
from Gounougou (9°05'N, 13°40'E), as described above. A 
WHO bioassay for DDT was conducted as described above. 
Then 35 mosquitoes that were dead after 1 hr of 4% DDT 
exposure (susceptible) and 35 alive mosquitoes (resistant) 
from Gounougou were genotyped for the L119F mutation 
using the TaqMan assay. 

L119F correlation with pyrethroid resistance 

To assess the correlation between the genotypes of the 
L119F mutation and pyrethroid-resistant phenotypes, 25 
mosquitoes that were dead after 1 hr of 0.75% permethrin 
(type I pyrethroid) exposure (susceptible) and 25 alive 
mosquitoes (resistant) from Gounougou were genotyped 
for the L119F mutation using the TaqMan assay. The same 
genotyping was conducted for lambda-cyhalothrin (a type 
II pyrethroid). Association between resistance phenotypes 
and the genotypes was assessed by estimating the odds ra- 
tios and the statistical significance based on the Fisher 
exact probability test [34] . 

Contribution of GSTe2 upregulation and the 119 F mutation 
to the resistance phenotype 

To assess whether GSTe2 upregulation and the presence of 
the 119 F mutation are both necessary to confer DDT 



resistance, we compared the expression levels of GSTe2 for 
the three genotypes of the L119F mutation alongside the 
susceptible FANG strain using qRT-PCR. Three batches of 
five mosquitoes from Gounougou were used for homozy- 
gote susceptible samples (C/C; L119/L119), heterozygote 
samples (C/T; LI 19/1 19 F) and homozygote resistant sam- 
ples (T/T; 119 F/119 F) following the protocol described 
above. The genotype of each mosquito was first established 
after a TaqMan assay using gDNA (genomic DNA) ex- 
tracted from the legs, and then mosquitoes from the same 
genotype were pooled for RNA extraction and cDNA 
synthesis. 

Geographical distribution of the L119F mutation across 
Africa 

To assess the geographical distribution of the L119F mu- 
tation across Africa, 30 field-collected An. funestus ss fe- 
males from nine countries belonging to different regions 
of Africa were genotyped by TaqMan: Benin (Pahou, 
collected in 2010-2011), Ghana (Obuasi, 2009), Burkina 
(Bobo-Dioualasso, 2010) and Cameroon (Gounougou, 
2006) in west-central Africa; Uganda (Tororo, 2009) and 
Kenya (Kisumu, 2012) in east Africa; and Malawi 
(Chikwawa, 2009), Zambia (Katete district, 2011) and 
Mozambique (Chokwe, 2009) in southern Africa. 

Population structure of GSTe2 across African An. funestus 
populations 

To assess the patterns of genetic variability of GSTe2 
across African populations of An. funestus in relation to 
DDT resistance and to detect the potential signatures of 
selection on this gene, full-length GSTe2 (exons and in- 
trons) was amplified and directly sequenced for 10-15 
field-collected female mosquitoes from six countries 
from different regions of Africa. These countries are 
Benin, Ghana and Cameroon in west-central Africa, 
Uganda in east Africa and Mozambique and Malawi in 
southern Africa. The patterns of genetic variability were 
analyzed as described above using dnaSP 5.10 [31]. The 
levels of pairwise genetic differentiation between the 
populations were estimated in dnaSP 5.10 using the K st 
statistic [35], although F st and N st estimates were also 
obtained for comparison. The significance of the K s *es- 
timates was assessed by permutation of subpopulation 
identities and re-calculating K s * 10,000 times, as imple- 
mented in dnaSP 5.10. 

Phylogenetic tree of GSTe2 haplotypes 

A maximum likelihood phylogenetic tree for the coding 
sequences of GSTe2 across Africa was constructed using 
MEGA 5.2 [32]. The best-fit substitution model was 
firstly assessed based on the Bayesian information criter- 
ion. This indicated that the Jukes- Cantor model best de- 
scribes the GSTe2 haplotype dataset out of 24 candidate 
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models. The Jukes-Cantor model was then used to gen- 
erate the maximum likelihood tree as implemented in 
MEGA with 500 bootstrap replications to assess the ro- 
bustness of the tree. Additionally, a haplotype network 
was built for both the full-length region (non-coding 
plus coding) and the coding region only, using the TCS 
program [33] (95% connection limit, gaps treated as a 
fifth state), to assess the potential connection between 
haplotypes and resistance phenotypes. 

Test of selection on GSTe2 

To test for positive selection acting on GSTe2 in relation 
to DDT resistance, several tests were carried out. Be- 
cause reduced levels of genetic variation are an indica- 
tion of positive selection particularly when there is a 
sweep through population nearing fixation [24], several 
genetic diversity estimates were computed using dnaSP 
5.10 and compared between the six populations. These 
parameters included the nucleotide diversity (ji), the 
haplotype diversity and 0, an estimate of 4N e p, with N e 
the effective population size and (A the mutation rate per 
nucleotide. Standard deviation estimates were computed 
by dnaSP 5.10. 

Departure from neutrality was also tested using differ- 
ent selection tests such as the HKA test and the MK test 
as implemented in dnaSP 5.10 using the GSTe2 se- 
quence from An. gambiae (AGAP009194) for the out- 
group. Possible evidence of positive selection was also 
investigated using the KJK S ratio (non-synonymous 
substitution rate/ synonymous substitution rate) with 
K a I K s > 1 indicating positive selection, K a I K s < 1 im- 
plying purifying selection and K a I K s =l suggesting 
neutrality [36]. Additionally, a codon-based Z test of se- 
lection was carried out to assess further the signature of 
positive selection of GSTe2 in resistant samples. This 
test uses the Nei and Gojobori method to compute the 
numbers of synonymous (dS) and non-synonymous (dN) 
substitutions per site and the numbers of potentially 
synonymous and potentially non-synonymous sites [37] 
as implemented in MEGA 5.2. The dN/dS ratio was cal- 
culated and the probability of rejecting the null hypoth- 
esis of strict neutrality (HO: dN=dS) in favor of the 
alternative positive selection hypothesis (HI: dN>dS) 
was estimated using the bootstrap method (1,000 repli- 
cates) in MEGA 5.2. 

GSTe2 protein expression and purification 

A resistant GSTe2 allele (119 F-GSTe2) from Benin (BN) 
and two susceptible alleles (L119-GSTe2) from Uganda 
(UG) and Malawi (MAL) were cloned into a pET28a 
expression plasmid between Ndel and Xhol sites to yield 
pET28a::BN-GSTe2, pET28a::UG-GSTe2 and pET28a:: 
MAL-GSTe2 constructs. The pET28a::BN-GSTe2, pET2 
8a::UG-GSTe2 and pET28a::MAL-GSTe2 plasmids were 



transformed into Escherichia coli strain BL21 (DE3) 
(Novagen, Madison, WI, US) for protein expression 
using standard protocols. A total of 5 ml of an overnight 
culture was sub-cultured into 500 ml of fresh 2TY broth 
medium plus kanamycin (50 (ig/ml). The transformed 
cells were grown at 37°C. GSTe2 expression was then in- 
duced with 0.3 mM of isopropyl-p-D-thiogalactoside 
when the OD (optical density) at 600 nm was 0.6 to 0.8 
overnight at 16°C. The cells were harvested by centrifu- 
gation (15 min, 4,500 g); resuspended in 25 mM TrisHCl 
pH 8.0, 500 mM NaCl, 20 mM imidazole and 5 mM 
p-mercaptoethanol; and disrupted by sonication. After 
centrifugation (40 min, 40,000 g), the clear supernatant 
was filtered, and the His-tagged GSTe2s was purified 
using Ni-NTA agarose (Qiagen,Valencia, CA, US) ac- 
cording to the manufacturer s instructions. The filtered 
supernatant was mixed with the previously equilibrated 
beads. After incubation, a washing step with ten volumes 
of 25 mM TrisHCl pH 8.0, 500 mM NaCl, 20 mM imid- 
azole and 5 mM p-mercaptoethanol buffer was per- 
formed. All constructs yielded 26.8 kDa products. After 
a full dialysis against 25 mM TrisHCl pH 8.0, 200 mM 
NaCl and 5 mM p-mercaptoethanol, the His-tag was 
cleaved using 7.5 units of thrombin per mg of tagged 
protein. A final purification step was performed using 
a Superdex 200 16/60 column (Amersham Biosciences 
Limited, London, UK) to obtain a highly purified sample 
(Figure 5A). The time courses of the chromatography 
were monitored by SDS-PAGE (Figure 5B). GSTe2 pro- 
teins were concentrated to 23 mg/ml with a 10-kDa 
cutoff Amicon protein concentrator (YM-10; Millipore 
Corporation, Bedford, MA, USA). The final protein con- 
centration was determined spectrophotometrically using 
the calculated molar absorption coefficient at 280 nm. 
The samples were kept at 4°C. 

Metabolic assay to assess the effect of GSTe2 on DDT, 
permethrin and deltamethrin 

The activity of GST was determined with a spectro- 
photometric assay to examine the formation of the 
conjugate of l-chloro-2,4-dinitrobenzene (CDNB) and 
reduced GSH. One unit of enzyme is defined as the 
amount of enzyme that yields 1.0 umol of conjugate 
product per minute at pH 6.5 and 30°C. Metabolism 
assays were conducted at 30°C for 60 min with shak- 
ing at 1,200 rpm, in a total volume of 0.5 ml. The 
buffer system was 0.1 M potassium phosphate buffer 
(pH 6.5), 2.5 mM GSH and 0.2 units of enzyme in 
the presence of 10 (ig/ml DDT, 0.025 mg/ml per- 
methrin or 0.03 mg/ml deltamethrin all in methanol 
(the solvent did not exceed 10% of the reaction total 
volume). The control sample contained the same re- 
agent mixture with the boiled recombinant enzyme. 
After 1 hr of incubation, 500 ul of methanol was 
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added to stop the reaction and the samples were then 
centrifuged at 13,000 rpm for 20 min at room 
temperature and 200 \i\ of the resulting supernatant 
were transferred to HPLC vials. The quantity of DDT, 
DDE and pyrethroid remaining in the samples was 
determined by reverse-phase HPLC with a monitoring 
absorbance wavelength of 232 nm (Chromeleon, Dio- 
nex, Sunnyvale, CA, US). Briefly, 100 \A of sample 
was injected into a 250 mm C18 column (Acclaim 
120, Dionex, Sunnyvale, CA, US) at 23°C. Separation 
of DDT and DDE was achieved using an isocratic 
mobile phase of 92% methanol and 8% water with a 
flow rate of 1 ml/min. Kinetic studies were conducted 
as previously described [8]. The results were analyzed 
by non-linear regression using GraphPad Prism v4.0 
software (GraphPad Software, Inc, San Diego, CA, 
USA). 

Additional GSTe2 metabolism of permethrin by 
gradient run 

To better resolve the metabolite peaks observed for per- 
methrin with the initial isocratic run, a further analysis 
of the metabolism profile of permethrin by GSTe2 was 
carried out using a gradient run condition. The parent 
compound and its metabolites were separated on an CI 8 
Acclaim column by injecting 100 \A of reaction products 
reconstituted in 1 ml methanol after an ethyl acetate ex- 
traction and evaporation step. The reaction mixture was 
2 ml with 10 (ig/ml permethrin and 0.2 unit of GSTe2. 
The reaction was initiated by adding 2.5 mM GSH. A 
bovine serum incubation mixture was used as a negative 
control. The mobile phases used were the solvent aceto- 
nitrile (A), methanol (B) and H 2 0 (C). The analytes were 
eluted with the following gradient programs (linear in- 
crease): 0 min (0% A, 5% B, 95% C), 15 min (37% A, 
5% B, 58% C), 25 min (60% A, 5% B, 35% C), 50 min 
(85% A, 5% B, 10% C), 51 min (95% A, 5% B, 0% C), 
56 min (95% A, 5% B, 0% C), 61 min (0% A, 5% B, 
95% C) and 69 min (0% A, 5% B, 95% C), at a flow rate 
of 1 ml/min. Peaks were detected at 232 nm. Data col- 
lection and analysis were conducted using Chromeleon 
software. 

Determination of GSTe2 structure and docking of DDT 
alleles 

Crystallization 

GSTe2 from Benin, Uganda and Malawi mosquitoes was 
purified as described in the supplementary information 
for the metabolic assays. The GSTe2 initial crystalliza- 
tion conditions were investigated by high-throughput 
techniques with a NanoDrop robot (Innovadyne Tech- 
nologies Inc., Santa Rosa, CA, US ) using the commer- 
cial screen solutions Crystal Screen 1 and 2 (Hampton 
Research, Aliso Viejo, CA, US), PACT Suite and JCSG 



Suite (Qiagen,Valencia, CA, US). Crystallization assays 
were conducted using the sitting-drop vapor-diffusion 
method at 18°C in 96-well plates (Innovaplate SD-2 
microplates, Innovadyne Technologies Inc., Santa Rosa, 
CA, US ). Drops of 250 nl protein at 23 mg/ml and 
250 nl precipitant solution were mixed and equilibrated 
against 65 \A of the well solution. Preliminary crystalliza- 
tion conditions led to crystal clusters of thin plates. 
Several strategies were used to optimize the best 
crystallization conditions, which included adjusting the 
protein sample composition, the precipitant concentra- 
tion and pH values, and screening with different addi- 
tives (Additive Screen, Hampton Research, Aliso Viejo, 
CA, US) or detergents. The final conditions were scaled 
up on 24-well plates (Linbro plates, Hampton Research, 
Aliso Viejo, CA, US) through hanging-drop experiments 
and on a 60- well microbath under oil (Terasaki plates) 
at 18°C. 

The crystals used in our analysis grew from a mix of 
1 [A of the protein solution at 23 mg/ml and 2 ul of a 
precipitant solution. After the refinement of several pa- 
rameters, isolated prismatic and rod-shaped crystals 
were obtained. BN-GSTe2 was crystallized by hanging- 
drop vapor-diffusion using a solution as precipitant con- 
taining 40% (v/v) PEG 300 and 0.1 M phosphate-citrate 
pH 4.2. However, UG-GSTe2 and MAL-GSTe2 were 
crystallized using a microbatch under oil technique with 
precipitant 25% w/v PEG 1500 and 0.1 M PCB (Propion- 
ate, Cacodylate, Bis-Tris Propane) buffer pH 6.0, and 
25% w/v PEG 1500 and 0.1 M MMT (L-Malic acid, MES, 
Tris)buffer pH 5.0, respectively. To grow holo_GSTe2 
crystals, 0.5 \A of 10 mM GSH (L-glutathione reduced)/ 
GSSG (L-glutathione oxidized) from Hampton Research 
were added to the crystallization drops. Several cryopro- 
tectants were tested, including glycerol, MPD (2-Met- 
hyl-2,4-pentanediol) and polyethylene glycol. The best 
cryoprotectant solution was 20% glycerol on the crystal 
mother liquor. 

Data collection and structure resolution 

Crystals were mounted on a fiber loop, transferred to 
the cryoprotectant solution and flash-frozen at 100 K in 
a nitrogen gas steam. Preliminary diffraction data were 
collected using an in-house Imaging Plate Mar345dtb 
detector (MarResearch, Norderstedt, Germany) with Cu 
K a X-rays generated by a rotating-anode generator 
(MicroStar, Bruker, Billerica, MA, US) with Helios mir- 
rors (Bruker, Billerica, MA, US) operated at 45 kV and 
60 mA. The apo_UG-GSTe2 and holo_MAL-GSTe2 
crystals were not suitable for X-ray data analysis because 
their diffraction was very poor. However, the crystals of 
apo_BN-GSTe2, holo_BN-GSTe2 and holo_UG-GSTe2 
had good diffraction patterns. A complete diffraction 
dataset was collected for each using the European 
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Synchrotron Radiation Facility (Grenoble, France) (see 
details in Additional file 15: Table S7). Diffraction data 
were processed with XDS (X-ray Detector Software) [38] 
and scaled with SCALA from the CCP4 package (Col- 
laborative Computational Project, Number 4, 1994). 
Molecular replacement with the program Phaser [39] 
was used to resolve the GSTe2 structures. The coordi- 
nates from the An. gambiae glutathione S-transferase 
epsilon 2 (agGSTe2) [PDB:2IL3] (92% sequence identity 
[27]) were used to resolve the apo_BN-GSTe2 and 
holo_UG-GSTe2 structures. The holo_BN-GSTe2 struc- 
ture was resolved using the holo_UG-GSTe2 coordi- 
nates. Several cycles of restrained refinement with 
PHENIX [40] and iterative model building with COOT 
were required to obtain the final models. The water mol- 
ecules were also modeled. The data collection, data pro- 
cessing and model refinement statistics are summarized 
in Additional file 15: Table S7. 

The stereochemistry of the models was verified with 
MolProbity, and the molecular model figures were pro- 
duced using PyMol (The PyMOL Molecular Graphics 
System, Version 1.5.0.4 Schrodinger, LLC, Portland, OR, 
US). The RMSDs between the proteins structures were 
calculated in COOT. The DALI algorithm was used for 
structure-based sequence alignment of BN-GSTe2 and 
UG-GSTe2 with other insect GSTs. 



Docking 

Docking of DDT to the different alleles of GSTe2 was 
performed using the Genetic Optimization for Ligand 
Docking (GOLD) software from the Cambridge Crystal- 
lography Data Center, UK. Both DDT and DDE were 
used in the docking calculations. The 3D structures of 
these molecules were obtained from the Cambridge 
Structural Database with reference codes CPTCEL and 
DCLPEY, respectively. The stereochemistry of the li- 
gands was confirmed with the Mercury program. Both 
holo_BN-GSTe2 and holo_UG-GSTe2 coordinates were 
used to define the binding site for molecular docking 
studies. All the solvent molecules in the structures were 
removed, and hydrogen atoms were added to the whole 
protein. In addition, hydrogen atoms were added to the 
cofactor molecule and cysteine SH group was deproto- 
nated to obtain GS" for the docking calculations. The 
docking cavity was defined as a collection of amino acids 
enclosed within a sphere with a 10 A radius around the 
GS" molecule, giving freedom of movement to F118, 
R112, E116, L119 or F119, F120, T165 and L207 and 
more restrained flexibility to Mill and F115 in the side- 
chain rotamers. The standard default settings were used 
in all calculations (number of dockings: 10), but early 
termination was allowed when the top three solutions 
were within 1.5 A RMSD from each other. 



Data access 

The DNA sequences reported in this paper have been de- 
posited in the GenBank database [GenBank:KC800340- 
GenBank:KC800421], the microarray data in ArrayExpress 
(E-MTAB-1578) and the 3D X-ray structures in the PDB 
database (BN-GSTe2-GSH [PDB:3zmk] and UG-GSTe2- 
GSH [PDB:3zml]). 

Additional files 



Additional file 1: Figure SI. Transcription profiling and functional 
analyses of GSTe2. (A) Volcano plot showing the differential expression 
pattern between DDT-resistant Benin mosquitoes and the susceptible 
FANG strain, with a twofold-change cutoff and P< 0.01. GSTe2 is highlighted 
as one of the most upregulated genes in Benin mosquitoes. (B) qRT-PCR 
validation of microarray upregulation of the main detoxification genes 
that were differentially expressed between resistant and susceptible DDT 
samples. c738 is a short-chain dehydrogenase (combined_c738) that is 
upregulated according to the microarray. (C) The relative expression of the 
GSTe2 transgene in the transgenic D. melanogaster Act5C-GSTe2 strain and 
the control sample with no transgene expression. The data shown are the 
mean ± standard error of the mean (n = 3). (D) Deltamethrin bioassay tests 
on transgenic Act5C-G57e2 flies (Exp-GSTe2) and control strains (two paren- 
tal (UAS-GSTe2 and GAL4-Actin) and F] progeny that do not express the 
GSTe2 transgene (Cont-NO)). 

Additional file 2: Table SI. Downregulated genes in the DDT-resistant 
Benin population of An. funestus. 

Additional file 3: Table S2. Genetic variability parameters for GSTe2 for 
resistant (alive) and susceptible (dead) mosquitoes fromKpome (Benin). 

Additional file 4: A PDF document containing supplemental results 
and discussion. 

Additional file 5: Figure S2. Detection of the 1 19 F GSTe2 resistance 
allele in An. funestus. (A) The results of the TaqMan diagnostic assay for 
genotyping L1 19F, with three genotypes unambiguously identified 
(three clusters). (B) The genotype distribution of L1 19F alleles across nine 
countries in Africa shows there is a strong correlation between L1 19F 
genotypes and known patterns of DDT resistance. For example, the 1 19 F 
(T/T) genotype is fixed in the highly DDT-resistant Benin population but 
is completely absent in the fully susceptible southern African populations 
(Malawi, Mozambique and Zambia). (C) Assessment of the correlation 
between the L1 19F alleles and the permethrin (type I pyrethroid)-resistant 
phenotype. (D) Assessment of the correlation between the L119F alleles 
and the lambda-cyhalothrin (type II pyrethroid)-resistant phenotype. 

Additional file 6: Figure S3. Metabolic activity of GSTe2. (A) DDT 
metabolism by GSTe2 from Benin has a high peak for the DDE metabolite 
product. (B) There is a reduction in the peak for permethrin metabolism by 
119 F-GSTe2 in the reaction with GSTe2 (blue) compared to the control 
(black) (three replicates) using isocratic conditions. The arrow indicates 
potential metabolites. (C) Additional permethrin metabolism by the Benin 
GSTe2 enzyme using gradient conditions. Blue peaks refer to the metabolism 
profile of active GSTe2 whereas black peaks refer to the bovine serum 
incubation mixture (negative control). The arrows indicate the three potential 
metabolites. The analytes were eluted with a linear increase gradient 
program. 

Additional file 7: Table S3. Selection parameters for G5Te2 across Africa. 

Additional file 8: Figure S4. Polymorphism patterns of GSTe2 in Africa. 
(A) Plot of genetic diversity parameters of GSTe2 across Africa that indicates 
there is strong directional selection of GSJe2 in Benin mosquitoes, hd, 
haplotype diversity; n, nucleotide diversity. (B) Haplotypes of GSTe2 (coding 
and non-coding) across six countries in Africa with contrasting DDT pheno- 
types. The polymorphic positions are indicated with numbers above the 
nucleotide, and the haplotypes are labeled from 1 to 39 with preceding 
initials from the country where the haplotype is predominant. An asterisk (*) 
shows that the haplotype was observed in other countries. N is the number 
of individuals who share the haplotype. (C) The same but only considering 
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coding regions. (D) The same but only considering non-synonymous 
substitutions providing the different protein variants of GSTe2 across Africa. 

Additional file 9: Figure S5. Haplotype network for the full-length 
GSJe2 (coding and non-coding regions) across Africa. The frequency 
of each haplotype is reflected by the size of its polygon. Each node 
represents a segregating mutation (the polymorphic position is given 
above the branches). The polygons in grey are resistant haplotypes that 
harbor the resistant 1 19 F allele whereas the haplotypes in white are 
susceptible. The mutational position of 499 is circled to indicate that this 
polymorphic position, which corresponds to L1 19F, is key to shaping the 
GSJe2 haplotype distribution. 

Additional file 10: Table S4. Genetic differentiation using K st . 

Additional file 11: Figure S6. The conformations of the GSH binding 
pocket (G-site) of the resistant BN-GSTe2 allele (purple) and the suscep- 
tible UG-GSTe2 allele (green) have no significant differences. The GSH 
cofactor and the protein amino acids and water molecules hydrogen- 
bonded to it are represented in ball-and-stick mode. 

Additional file 12: Figure S7. Comparative analysis of the H-site 
structure of different GSTe2 alleles and agGSTd1-6. The size and shape 
of the site are key determinants of DDT metabolism capacity. (Top) The 
molecular surface representations of those residues constituting the 
H-site of Uganda-GSTe2 (green), Benin-GSTe2 (purple), agGSTe2 (white) 
and agGSTd1-6 (yellow). Oxygen and nitrogen atoms at the surface are 
depicted in red and blue, respectively. (Bottom) Overlays of the structures 
of Uganda-GSTe2 (green) with Benin-GSTe2 (purple), agGSTe2 (white) 
and agGSTd1-6 (yellow) showing that most differences are in the 
secondary structural elements related to the H-site. 

Additional file 13: Table S5. List of primers used in this study. 

Additional file 14: Table S6. Primers for TaqMan L1 19F GSJe2 assay. 

Additional file 15: Table S7. Summary statistics for data collection, 
processing and model refinement. 
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